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Pythagoras Theorem in Noncommutative Geometry 

Francesco D ’Andrea 


Abstract. After a review of the results in [11] about Pythagorean inequalities 
for products of spectral triples, I will present some new results and discuss 
classes of spectral triples and states for which equality holds. 


1. Introduction 

The Pythagorean Theorem (Euclid 1.47) is probably the most famous statement 
in all of mathematics, and also the one with the largest number of proofs. There 
is a collection of 366 different proofs is in the book by E.S. Loomis [20], and 
a track online of some false proofs [1], which includes some from Loomis’ book 
itself. A celebrated visual proof is the one in Figure la, which Proclus (ca. 412-485 
AD) attributes to Pythagoras himself [ 21 , p. 61]. A similar one [ 20 , Proof 36] is 
considered the first ever recorded proof of the Pythagorean Theorem [ 21 , Chap. 5]. 
A chronology can be found in [21, p. 241]. 

There are generalizations to hyperbolic and elliptic geometry as well. For a 
right-angled geodesic triangle in the hyperbolic plane (Fig. lb): 

cosh(a) = cosh(fe) cosh(c) , 
see e.g. [ 34 , p. 81], while on a unit sphere: 

cos(a) = cos(6) cos(c) . 

In both cases, if one performs a Taylor series expansion, at the leading order one 
retrieves the usual Pythagoras theorem, which then holds for “very small triangles”. 

Pythagoras theorem for “very small triangles” is exactly the way one defines the 
product metric in Riemannian geometry. On a Cartesian product M = M\ x M 2 
of two Riemannian manifolds, one defines the line element as 

ds 2 = ds\ + ds\ (1.1) 
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(a) The blue area equals the total 
area minus four times the red area, so: 


(b) A hyperbolic right-angled triangle 


a 2 = (b + c) 2 - 4(6c/2) = b 2 + c 2 . 


Figure 1 


where ds\ resp. ds 2 are the line elements on Mi resp. M 2 . As shown in [ 11 ], one 
can integrate last equality, and prove that Pythagoras equality holds for geodesic 
right-angled triangles, provided the two legs are “parallel” one to Mi and the 
other to M 2 (cf. §5.1). The first complications arise in this example when points 
are replaced by probability measures, and the geodesic distance is replaced by its 
natural generalization: the Wasserstein distance of order 1 (see e.g. [ 36 ]). One can 
realize via a simple example that Pythagoras equality is replaced by the inequalities 


\Jb 2 + c 2 < a < \fi \Jb 2 + c 2 , 


( 1 . 2 ) 


and that the upper bound - with coefficient %/2 - is optimal (in §3.3 of [ 11 ] we 


exhibit an example where a assumes all values between the lower and upper bound). 

Now, (1.2) is a particular instance of a formula which holds in noncommutative 
geometry. Given a C*-algebra A, we can define a distance on its state space 5(A) 
by means of a spectral triple. Suppose A = Ai <g> A2 is a tensor product of two 
C*-algebras. Identifying elements ip = (ipi,p 2 ) and ip = (ip 1 ,^ 2 ) £ 5(A0 x <S(A 2 ) 
with product states of A, one may wonder whether Pythagoras equality holds for the 
triangle with vertices ip, ip and (ipi, P2) (Fig. 2). It turns out that (1.2) is valid even 
in this more general framework [ 11 ]. More precisely, the right inequality is valid 
for arbitrary spectral triples, while the left one is valid for unital spectral triples 
only, but it holds for arbitrary states (with components Pi,p 2 ,ipi,ip 2 replaced by 
marginals). I will present a slightly different proof of the inequalities (1.2) for 
spectral triples in §3 (including the generalization to marginals). 
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It is then natural to wonder for which states and spectral triples the left inequal¬ 
ity in (1.2) is actually an equality, that is: Pythagoras theorem holds. In the exam¬ 
ple M = Mi x M 2 of Riemannian manifolds, it holds for geodesic triangles, hence 
when the vertices are pure states of the commutative algebra Co (Mi )0 Cq {M 2 ) [11]. 

In the case of pure states, the equality was proved for the product of any Rie¬ 
mannian manifold with a two-point space [ 23 ] , and for the product of Moyal plane 
and a two-point space [22] under the hypothesis that the two states on Moyal plane 
are obtained one from the other by a translation (so, even in this non-commutative 
example, there is a geodesic flow connecting the vertices of the triangle). 

In §5.3, I will discuss the discrete analogue of the example of Riemannian 
manifolds: I will prove that the equality holds for pure states in the product of 
canonical spectral triples associated to two arbitrary finite metric spaces. One 
interest for this proof is that it is purely algebraic and makes no use of geodesics. 
In §6, I will generalize the result in [ 23 , 22 ] to the product of an arbitrary spectral 
triple with the two-point space, under the hypothesis that the states considered are 
connected by some generalized geodesic; I will then explain how to extend such a 
result from the two-point space to an arbitrary finite metric space. As a corollary, 
we get Pythagoras equality for arbitrary pure states in the product of a Riemannian 
manifold and a finite metric space, thus completing the “commutative” picture, and 
the product of Moyal plane with a finite metric space. 

At this point one could believe that Pythagoras equality is a property of pure 
states, but there is at least one example where it holds for arbitrary product states: 
it is the product of two two-point spaces (the simplest example conceivable), as I 
will explain in §5.2. 

A much more difficult question is whether the equality holds for pure states in 
the product of arbitrary (unital 1 ) spectral triples. In fact, pure states of a commu¬ 
tative C*-algebra are characters. At the present time, there is no general proof nor 
counterexamples (even noncommutative) to Pythagoras equality for pure/character 
states. 

1 1n the non-unital case, sometimes even the left inequality in (1.2) is violated by pure states. 
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The paper is organized as follows: there is a first part discussing the general 
theory (§2 and §3), a second part where I collected examples (from §4 to §6), and 
a section (§7) where I highlight some analogies between the spectral distance and 
the purified distance of quantum information. 

2. Some preliminary definitions 

2.1. Cartesian products and the product metric. If X is a set, we call 
d : X x X -A [0, Too] an extended semi-metric if, for all x,y, z € X: 

i) d(x, y) = d{y, x) (symmetry); 

ii) d(x,x) = 0 (reflexivity); 

iii) d(x, y) < d(x, z) + d(z,y) (triangle inequality). 

If in addition 

iv) d(x, y) =0 => x = y (identity of the indiscernibles), 

we call d an extended metric [12]. It is a metric ‘tout court’ if d{x,y) < oo V x,y. 
For p > 0, we denote by 

IMI P = (M p + M p + --- + Kr) ? 

the p-norrn of v = (iq,..., v n ) € R n , and by ||v||oo = max,; |iq| the sup (max) norm. 

Given two metric space (Xi,di) and (X 2l d 2 ), one can verify that the formula 

d(x,y) := || ( di(xi,yi), d 2 (x 2 , y 2 )) || p , V x = (x\,x 2 ),y = {yi,y 2 ) G X ± x X 2 , 

defines a metric on the Cartesian product X = X\ x X 2 , for any fixed p G (0, oo]. 
For p = 2, we call this distance the product metric and denote it by d\ Md 2 . 

More generally (for any p), if d\ and d 2 are extended metrics, then d is an 
extended metric, and if they are extended semi-metrics, so is d? 

2.2. Forms and states on a C'-algebras. A form on a complex vector 
space A is a linear map p : A —} C. Given a form ip : A —> C on a unital algebra A , 
I will denote by pfi : A —>• A the map ip^a) = ip(a)lA■ Note that if p{\ a) = 1, then 
p^ 1 is an idempotent. Here by idempotent I mean an endomorphism 7r of a vector 
space satisfying n o ir = tt 1 while I will reserve the term projection for bounded 
operators p on a Hilbert space satisfying p 2 = p* = p. 

A state on a complex C*-algebra A is a form which is positive and normalized: 
<p{a*a) > 0 V a G A , ||<£|| = 1 , 

2 The non-trivial part is to prove that d satisfies the triangle inequality. For x. y, z S X, 
let u,v,w £ K 2 be the vectors u = (di(o:i, yi), d 2 {x 2 , y 2 )), v = (di (xi, zi), d 2 (x 2 , z 2 )), and 
w = (di (zi, yi), d 2 (z 2 , J/ 2 )) - Then dix. y) = ||u|| p < ||a-)-«’||p (since the norm is a non decreasing 
function of the components, and d\,d 2 satisfy the triangle inequality). From the norm inequality 
||ti + tu||p < ||u||p + 11to11p = d(x. z) + d(z. y) we get then the triangle inequality for d. 
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where ||y>|| is the norm dual to the C*-norm of A: 


IMI := 


sup 

a£ A,a^O 


Mq)II 

IMI 


If A is unital, the normalization condition is equivalent to p{1a) = 1- 

States of A form a convex set 5(A). Its extremal points - states that cannot 
be written as a convex combination of other (two or more) states - are called 
pure. Pure states x of C'o(M), with M a locally compact Hausdorff space, are 
identified with points via the formula x(f) := f(x), and are in fact characters (one 
dimensional representations) of the algebra Co(M). 

If A is a concrete C*-algebra of bounded operators on a Hilbert space H, a 
normal state p is one of the form 


ip(a) = Tr n(pa) , V a G A, 


where p is a positive trace class operator normalized to 1, called a density operator 
or density matrix for p (and in general is not unique). For an abstract C*-algebra, 
we can talk about normal states with respect to a given representation (cf. §2.4.3 
and Def. 2.4.25 of [3]). 

If A is a finite-dimensional C*-algebra every state is normal; if A = M n ( C) with 
its natural representation on C" the correspondence between density matrices and 
states is a bijection; this is true even if A is the C*-algebra of compact operators 
on a infinite-dimensional separable Hilbert space, with the weak* topology induced 
by the trace norm on density matrices (Prop. 2.6.13 and Prop. 2.6.15 of [3]). 

For a discussion of states from the point of view of quantum physics one can 
also see [18]. On the probabilistic interpretation of states, and relation between 
quantum mechanics and probability theory, one can see [29, App. A]. 


2.3. States of a bipartite system. Let A = A\ ® A 2 be the minimal tensor 
product of two unital C*-algebras A\ and A 2 . Given a state p on A, we define its 
marginals as the states p\ G 5(Ai) and p\ e 5(A 2 ) given by 3 

<Pi(oi) = ¥>(ai ® 1) , ^ 2 ( 02 ) = ¥>(1 ® a 2 ) , 

for all ai G A\ and a 2 G A 2 . 

We call ip a product state if it is the tensor product of its marginals 4 , i.e. if 
it is of the form p = p\ ® p 2 for some pi G 5(Ai) and p 2 G <S(A 2 ) (note that 
the latter condition makes sense in the non-unital case too). The set of product 
states will be identified with the Cartesian product 5(Ai) x 5(A 2 ) via the bijection 
Pi < 8 >P 2 | —> {pi,P 2 ), and more generally there is a surjective (but not injective) map 

5(A) -» 5(Ai) x 5(A 2 ) , p 1 y (p\, ^ 2 ) ■ (2A) 


3 To simplify the discussion, I give here the definition only in the unital case. 
4 Here I adopt the terminology of [2, 29] (slightly different from the one of [10]). 
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A state is called separable if it is a convex combination of product states. In fact, 
for infinite-dimensional algebras, we may want to consider the closed convex hull 
of product states, and call p £ 5(A) separable if it is of the form 

P = p n Tl,n ® T2,n , ( 2 - 2 ) 

n>0 

where Pi <n £ 5 (A*) for all i =1.2 and n > 0, {p n }n>o is a probability distribution, 
and the series is convergent in the weak* topology. A state which is not separable 
is called entangled. 

In the case of matrix algebras, separable states are the ones with density matrix 
of the form 

k 

P = ^2pi Pl,i® P2,i ■ 

*=1 

If p is of the above type, applying the transposition to the first factor we get a new 
positive matrix p Tl . This simple observation (called “Peres’ criterion”, cf. §15.4 of 
[2]) allows to prove that entangled states exist. 


Example 2.1. Let Ai = A- 2 = M2(C), {ej}j = i ; 2 be the canonical basis of C 2 
and eij the matrix with 1 in position ( i,j ) and zero everywhere else. Let peeii be 
the projection in the direction of the unit vector ^j(ei <8> C\ + e 2 ® e 2 ), hence 


PBell = 



Thus, p Beil Is a pure state of the composite system A 


A\ (g) A 2 . Since 


7i 


^ Bell - 2 2^ 


1 


has an eigenvector v := e\ <g> e 2 — e 2 e± with negative eigenvalue (PBeii' 1 ’ = — \ v )> 
it is not a positive matrix, and the state associated to p Beil is entangled. 


The study of entanglement is an extremely active area of research in physics. 
For the quantum mechanical interpretation of marginals as projective measure¬ 
ments, and entanglement in quantum mechanics, one can see [2, §15] or [29, §10.2]. 
The one in Example 2.1 is a composite system of two qubits, and pBeii is a Bell 
states for this system [29, Ex. 3.2.1]. 

Note that any separable pure states is a product state, since a pure state by 
definition cannot be a convex combination of other two or more states. Hence for 
pure states these two notions coincide. On the other hand, pure states can be 
entangled, as in Example 2.1. 

In the case of commutative C*-algebras A\ = C'o(A'i) and A 2 = Cq(X 2 ), from 
the identification A = A\ (g> A 2 ~ Co(X 1 x X 2 ) we deduce that every pure state of A 
(= point of Xi x X 2 ) is a product state. In the unital case, 5(A) is a compact convex 
set, hence by Krein-Milman theorem it is the closed convex hull of its extreme 
points. This and the above observation on pure states proves that every state 
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of a commutative unital C*-algebra A is separable: entanglement is an exclusive 
property of noncommutative C* -algebras/quantum systems. 

Example 2.2. If A = M n ( C), pure states are in bijection with rank 1 projec¬ 
tions, and then with points o/CP” -1 . The map CP m_1 x CP ” -1 —» CP mn_1 into 
product states of a bipartite system is given by the Segre embedding [2, §4.3]. 

Example 2.3. If A = C n , 5(A) is the set of pi'obability distributions on n 
points, which geometrically is the standard (n — 1)-simplex: 

5(A) = A " -1 := { p = {pi, ■ • • ,p n ) e ■■ Pi > 0 A J2, Pi = 1 } • 

The embedding as pj'oduct states A m_1 x A " -1 —> A mn_1 , (p,q) ^ x := (pi.qj), 
has for image the subset of x = ( Xij) satisfying the algebraic equations 

W%jXkl — XuXkj , V i,j,k,l. 

If m = n = 2, we get the quadric surface described in Appendix A. 

2.4. Spectral triples and the spectral distance. If A is a *-algebra of 
bounded operators on a complex Hilbert space, we denote by 5(A) the set of states 
of the norm closure of A, and by A sa the set of selfadjoint elements of A. 

A natural way to construct a metric on 5(A) is by means of a spectral triple. 
Standard textbooks on this subject are [5, 14, 17]. 

Definition 2.4. A spectral triple (A,TL,D) is the datum of a ^-algebra of 
bounded operators on a Hilbert space TL and a (unbounded) selfadjoint operator D 
on TL, such that a(D + i)~ x is compact and [D,a\ is bounded for all a € A. A 
spectral triple is: 

i) unital if 1 b(H) G A (the algebra is unital and its unit is the identity on TL); 

ii) even if there is a grading 7 = 7 *, 7 2 = 1, commuting with A and anticom¬ 
muting with D. 

We call D a (generalized) Dirac operator. 

Although the definition makes sense for real Hilbert spaces as well, and there 
are examples where one is forced to work over the field of real numbers (e.g. in the 
spectral action approach to the internal space of the Standard Model of elementary 
particles, see e.g. [ 6 , 32]), here I will focus on complex algebras and spaces (the 
only exception being the example in §4.5). 

If (A,TL,D) is a spectral triple, an extended metric do on 5(A) is defined by: 

d D (<p,if)= sup {p(a) — if (a) : || [D, a] || < 1} , (2.3) 

for all ip,if £ 5(A). We refer to this as the spectral distance. 

Note that from any odd spectral triple (A,TL,D) we can construct an even 
spectral triple (A, TL (g> C 2 , D < 8 » ay, 1 ® cr 3 ) without changing the spectral distance. 
Here 07 , cr 3 are Pauli matrices, we use the obvious representation a 1 —> a ® 1 of A 
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onW®C 2 , and since [D, a] and [D <g> o \, a (g> 1] = [D, a] (g> o\ have the same norm, 
clearly the distance doesn’t changes. So, we do not loose generality by considering 
only even spectral triples. 


2.5. Spectral distance between normal states. For normal states there is 
a formula for the distance (2.3) which is a little bit more explicit. Let (A. H, D) be 
a spectral triple and two distinct normal states with density matrices pi,p 2 

satisfying p := p\ — p -2 G A. Call 

(°, b) Tr = Tr(a* 6 ) (2.4) 

and let 11 a 11 Tr be the associated norm (whenever they are well-defined, for example 
for a G £ 1 ('H) and b G B{TL) the former, and for a € C 2 {T~L) the latter). Let 

V p := {a = a* £ A : (a, p ) Tr = 0} 

be the subspace of A sa orthogonal to p, and set 

L(p) := inf \\[D,p + a\\\ . 

aeVp 

Proposition 2.5. Either L(p) = 0 and dn{p 1 ,^ 2 ) = 00 , or L(p) 7^ 0 and 

d D (vi,<P2) = i(p) _1 j|p||Tr • (2-5) 


PROOF. Let W := {a G A sa : a £ V p }. We distinguish between two cases: 

i) 3 a 0 G W such that [D,a 0 ] = 0; 

ii) [D, a] 7 ^ 0 for all a G W. 

In the first case, any a = Aao satisfies ||[D,a]|| < 1 and: 


d D {ipi,(p 2 ) > A(p,a 0 ) Tl . VAgR. 


Since {p, ao)^ 7 ^ 0 by definition of W, we get do{p 1 , P 2 ) = 00 . Moreover, called 


b = too — p , t := 


12 

llTr 


{pi ®o) Xr 

one checks that b G V p , and [D,p + b\ = t[D,ao] = 0. Therefore, L(p) = 0. Thus, 
the proposition is proved in case (i). Now we pass to case (ii). 

Since (p, a) Tr = 0 for a G V p , we can write 

, , x (P,a) Tr 

<io(»>l,w) = sup 

where by hypothesis the denominator is not zero. 

Any a G *4 sa , a ^ V p , can be written in a unique way as 

a — A (p + b) , 

with A ^ 0 and b G V p (take A := (p, a) Tl . /HpIIti-’ ^ ;= — Pi and check that 

b G V p ). Thus: 


doiViiV 2 )= sup 


-MIpIItt 


x*oTev p \\\[D,p + b]\\ !^ p \\[D,p + b}\\ ■ 


II2 
II Tr 
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Now the conclusion is obvious: either the sup is infinite, when the inf of the de¬ 
nominator is zero, or L(p) 0 and pf) = 1 IIpIIti - ® 

Remark 2.6. In [31, Eq. (38)], the authors give a formula similar to (2.5), 
with L replaced by ||[Z),p]||. They claim that do(T 1 ,^ 2 ) is equal to: 

M 2 Tr/\\[D,p]\\. 

This statement unfortunately is wrong. A counterexample is in Prop. 3.10 of [ 4 ] : 
we have normal pure states at infinite distance, despite the fact that [D, p] is not 
zero (only scalar multiples of the identity commute with the Dirac operator of [4]). 

Remark 2.7. Let A = M n ( C) be naturally represented on TL = C n © C k , 
with n,k> 1. In this case states and density matrices in A are in bijection, and 
Prop. 2.5 gives the distance between arbitrary states. 

Assume further that [D,a] = 0 only if a is proportional to the identity (“con¬ 
nectedness” condition); note that this forces k > 2 , otherwise D G A commutes with 
a = D. In such a case it is well known that the distance is finite (see e.g. Prop, f.2 
of [ 4 ]), and then given by (2.5) for any pair of states. 

One should stress that, even if (2.5) may look simpler than (2.3), it is not. We 
just traded a sup for an inf in the definition of L(p). 

3. Products of spectral triples 

The product (A,TL,D) of two spectral triples {A\, TLi, Di, 71 ) and {A- 2 ,Tl- 2 , D 2 ), 
is defined as 

A = Ai © A 2 , TL = TL\ © TL 2 , D = D\ © 1 + 71 © D 2 , (3-1) 

where 0 is the algebraic tensor product. 

On the set <S(-4i) x 5(-4 2 ) of product states of A, two extended metrics are 
defined: the spectral distance do and the product distance dojKIdiv The purpose 
of this section is to discuss the relation between these two. 

In the Euclidean space, Pythagoras equality is a criterion that can be used to 
decide whether two intersecting lines are orthogonal or not. This motivates the 
next definition. 

Definition 3.1. If 

d D (p, if) = d Dl IE d D2 (ip, ip) 

we will say that the states p,ip satisfy Pythagoras equality. If such equality holds 
for all pure states, we will say that the product is orthogonal. 

The main result of [11] is that for arbitrary unital spectral triples the distances 
are equivalent (although not necessarily equal), and more precisely: 

Theorem 3.2. For all product states p,ip: 

u) (a) 
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The inequality (ii) is in fact valid also when the spectral triples are non-unital, and 
it is a consequence of the following basic property: taking a product of spectral 
triples doesn’t increase the horizontal resp. vertical distance. That is: 

Lemma 3.3. For a product of arbitrary (not necessarily unital) spectral triples: 

dD(<P,il>i ® ¥> 2 ) < d Dl {(fii,ipi) and d D {ifi ® ^ 2 , if) < d D2 {(fi 2 , 1 ( 2 ) , 

for all (fi = (fi 1 <S> <fi 2 and if = ifi ® 1(2 ■ 

PROOF. For a G yl sa denote a\ := (id ® ip 2 ){a) G -4i a and observe that 

- {if 1 ® ){a) = - ifi{ai) . (3.2) 

From Lemma 9 and Corollary 11 of [ 11 ] (I will not repeat the proof here): 

|| [Dl, Gl] | < ||[Ao]|| ■ 

(There is a typo in Eq. (18) of [ 11 ]: there is no square in the last norm.) 

Thus 

do{(f, tfi ® (Pi) = sup {(fii{ai) - V’i(ai) : ||[-D,a]|| < l} 

< sup {(fii{ai) — ifi{ai) : || [Hi, ai] || < l} 

and this can be majorated by considering all ai G „4| a rather than only those in 
the image of the map „4 sa -A- *4| a above, and we get 

< sup {<fii{ai)-ifi(ai):\\[Di,ai]\\<l} =d Dl {(pi,ifi) . 

CLl £*4.^ a 

This proves the first inequality of Lemma 3.3, the other being similar. ■ 

Proof of Theorem 3.2(h). Now, call a := d£>{<p,if), b := dn{(fi,ifi ® ^ 2 ) 
and c := do{ifi ® (fi 2 , if) the lengths of the edges of the triangle in Fig. 2. From the 
triangle inequality and Cauchy-Schwarz in M 2 : 

a < b + c = ( 1 , 1 ) • {b, c) < \/2\Jbi 2 + c 2 . 

From this and Lemma 3.3, we get the inequality (ii) of Theorem 3.2. ■ 

In the unital case, (i) implies that the inequalities in Lemma 3.3 are in fact 
equalities: so, the product doesn’t change the horizontal resp. vertical distance. In 
the non-unital case, on the other hand, there are simple counterexamples [ 11 , § 6 ]. 

One may wonder if Theorem 3.2 can be generalized to arbitrary states, i.e. if 

d Dl M d D2 {‘fi i ',if l> ) < d D {ifi,if) < V2d Dl M d D2 {(fi i> ,if b ) ? 

for all (fi,if G 5(^4), where (f? := (p\ ® <P 2 and if* 0 := if\ ® if\ are defined using 
marginals. It is easy to convince one-self that the inequality on the right can’t 
always be true: take two different states with the same marginals, fi 7 ^ if but 
(fif = then the product distance is zero, but do{(fi,if) 7 ^ 0 , and we get a 
contradiction. What fails in the proof of Lemma 3.3 is the equality (3.2), which 
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holds only for product states. On the other hand, I will show that the inequality 
on the left is valid for arbitrary states and unital spectral triples. 

3.1. An auxiliary extended semi-metric. From now on, I will assume we 
have a product of two unital spectral triples (counterexamples to Theorem 3.2(i) 
in the non-unital case can be found in [11, §6]). Consider the vector space sum: 

A\ + A 2 := {a = ai ® 1 + 1 ® a 2 | a\ £ Ai, a 2 £ A 2 } , (3.3) 

where we identify A\ with A\ ® 1 and A 2 with 1 ® A 2 . 

An extended semi-metric df, on 5(A) is given by 

d£>(^>V0 = SU P Ma) - V’(a) : II [A a] II < l} • 

ae(Ai+A 2 ) sa 

Since (Ai + A 2 ) sa is a subset of A sa , we have the obvious inequality 

< d D ( ip , if ) , (3-4) 

for all p,if £ 5(A). The next one is Lemma 8 of [11], of which I will give a shorter 
proof. 

Lemma 3.4. For any self adjoint ai £ Ai and 02 £ A2 : 

|| [Di,a\]|| 2 + \\[D 2 ,a 2 ]\\ 2 = || [ D, ai ® 1 + 1 ® a 2 ]|| 2 . (3.5) 

PROOF. Consider the positive operators A := — [Z?i, ai] 2 , B := — [D 2 , a 2 ] 2 , and 
note that 

C := — [D, ai ® 1 + 1 ® a 2 ] 2 = A®l + l(g).B. 

From the triangle inequality, we get ||C|| < ||A|| + ||H||. We want to prove the 
opposite inequality. We achieve this by writing the left hand side in (3.5) as: 

l.h.s. = sup { (iq, Avi) + (v 2 ,Bv 2 ) } = sup (tq ® u 2 , C(vi ® n 2 )) 

V\€l'H\,V2€lH.2 Vi€'Hi 1 V2€'H2 

IviIhMM 11^11=11^11=1 

< sup ( v,Cv) = He'll = r.li.s. , 

vCH: wj-l 

where last inequality comes from considering all unit vectors in B, rather than only 
homogeneous tensors tq <g> i> 2 . ■ 

Proposition 3.5. For any tp,if one has 

= d Dl ^d D2 (ip b ,tjj b ) , 

where, as before, p\ and p\ are the marginals of p and p^ — p\ ® p\ ■ 

PROOF. For any a = ai®l + l®a 2 €: (Ai + A 2 ) sa , 

p{a) - ip(a) = {p\{ai) - ^(aq)} + {p\(a 2 ) - ^ 2 ( 02 )} • 

From (3.5), 

do(<P,’'!’) = sup ( sup {^i(oi) - V’J(oq) : ||[£>,a 1 ]|| < a} 
o:,/5>0 \ a\=a\^A\ 

q 2 +/3 2 = 1 
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+ sup {v\{a 2 ) - V4(a2) : || [D,a 2 ]|| < /?} 

02=^2 £*4 2 

= sup (ad Dl (</>Ui) + /3 ^d 2 (^2)V , 2)) = JdoA^i^i) 2 + d D 2 (^ 2 ^ 2) 2 > 
a,/3>0 ' /v 

a 2 +/3 2 = l 

where last equality comes from Cauchy-Schwarz inequality (cf. [11, Lemma 7] for 
the details). ■ 

As a corollary, from last proposition and (3.4), we get 

d D (<p, ip) > d Dl ^ d D2 (ip' l> , ip b ) ( 3 - 6 ) 

for arbitrary states. Clearly, since the product distance depends only on the 
marginals, for p ^ ip with the same marginals the right hand side of (3.6) is zero, 
while do is not. Whatever the spectral triples are, (3.6) cannot be an equality on 
arbitrary states, but it could be on the set of product states, where d is a proper 
extended metric (or on some smaller set). 

3.2. Product states. In this section we give a criterion (Prop. 3.7) to check 
whether do is the product metric on product states. As an application, I will show 
in §5.2 that in a product of two two-point spaces, Pythagoras equality holds for 
arbitrary product states. 

For any p = p\ (g) p 2 and ip = ip\ (g> ip 2 we define a map P v , 4 , : A —> A by: 

Pep, 4 , ■= <p\ <8 id + id (g) ip\ - p\ ® ip\ . (3.7) 

One can verify that id — P v , 4 , (and then Pp,^,) is an idempotent by looking at the 
identity: 

id - Pp, 4 > = (id - p\) 0 (id - ip\) (3.8) 

and noting that p\,ip\ (and then id — p\ and id — ip\) are idempotents. 

Lemma 3.6. We can decompose A as a direct sum (of vector spaces, not alge¬ 
bras): 

A = range(P Vi ^,) ® ker(P^) , (3.9) 

where range and kernel of the idempotent are: 

range(P /3) ,/,) = A\ + A 2 , keifPp^) = kerpi ® ker ip 2 , 
and A\ + A 2 is the set (3.3). 

PROOF. Clearly rang e(P Vt 4 ,) C Ai + A 2 ■ But a simple check proves that P v> ^ 
is the identity on A\ + A 2 , hence a = P v ^(a) V a £ A\ + A 2 and the opposite 
inclusion A 1 +A 2 C range(P ¥ , i1 / ) ) holds too, proving that the two sets coincide. The 
range of id — p\ is the kernel of p\, and similar for id — ip\. Hence the range of 
id — Pp, 4 >-, which is the kernel of P V)J />, is keryq ^kerip 2 - Any a £ A can be written 
in a unique way as a = Pp i 4 ,(a) + (id — P v ^)(a), proving (3.9). ■ 
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Proposition 3 . 7 . Let p = <p\ ® p>2 and ip = ipi®ip2- If, for all a £ A sa , 

\\[D,P^(a)]\\ < ||[Ao]|| , (3-10) 

then do{<p, ip) = d X D (p,ip) (so, the states <p,ip satisfy Pythagoras equality). 

PROOF. From range(id — P v ,q,) = ker p\ <g» kerip2, id — P v ^ maps A into the 
kernel of both ip and ip. So, for all a £ A: 

<P{P V M) = <p(a) , ip(P v ^(a)) = ip (a) . 

Moreover, b = P v ^(b) for all b £ A\ + A2- Thus: 

do{p,ip)= sup {tp(P^{b)) - ip(P v ^{b)) : \\[D, P v ^(b)}\\ < l} 

bdz(~ 4.i+w42) sa 

= sup {p(P v ^(a)) - ip(P v ^{a)) : \\[D, P^(a)]\\ < l} 

= sup {tp(a)-ip(a) : \\[D, P v ^{a)]\\ < l} . 
a£A. sa 

Now, from (3.10), we deduce that dp(ip, ip) > do{p, ip). But the opposite inequality 
also holds, cf. (3.4), hence the thesis. ■ 

3.3. Normal states. Let ip = p\ (g) p 2 and ip = ipi ® ip 2 be as in previous 
section. In this section, I assume that <p\ and ip 2 are normal states with density 
matrices p\,p2'- 

(a) = Tr Wl (pia) , ip 2 (b) = Tr n 2 (p 2 b) , 

for all a £ A-\ and b £ A 2 . The above formulas allow then to extend p\ and ip2 
to the whole B(fH\) resp. B(fH 2 ), and P v ^ to the whole BfH). Note however that, 
since in general the density matrix of a given state is not unique, the extension is 
also not unique. 

Let K be the norm of the idempotent (3.7): 

A':= sup "1 
6eB(K):M 0 Irl 

Lemma 3.8. 1 < K < 3. 

PROOF. Any b £ rang e(P v ^) gives a lower bound 1. The upper bound comes 
from the norm 1 property of a state together with the triangle inequality. ■ 

Lemma 3.9. Assume p\ commutes with 71 and either: i) P2 commutes with D2 
(it can be, for example, an eigenstate of the Dirac operator); or ii) {A 2 , TL2, D2, 72) 
is also even (although we do not use 72 in the definition of the product spectral 
triple) and P2 commutes with 72. Then: 

<M7i[- D i,a]) = 0, V«£ Ai, (3.11) 


and 


ipi{[D 2 ,h]) = 0 , 


V&e A 2 . 


(3.12) 
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PROOF. If [pi, 71] = 0, since 71 anticommutes with D 1 and commutes with A\, 
then for all a G A\: 

= -<Pi([-Di,abi) = -Tr Wl (p 1 [D 1 ,a] 7 i) 
and from the cyclic property of the trace: 


= —Tr Wl (7ipi[£>i,a]) = -Tr Wl (pi7i[Di,a]) 

= -pi(7i[L>i,a]) , 

This implies (3.11). 

The proof of (3.12) under the hypothesis (ii) is similar. If [^ 2 , 72 ] = 0, then 
ip 2 {[D 2 ,b]) = -02 (72 [^ 2 , 6 ]) = -ip 2 {l 2 {D 2 Al 2 ) 

= -Tr H2 (p 2 72[A>,fr]72) = -Tr W2 (7 2 p272[A>,fr]) 

= -Tr- H2 (p 2 72[- D 2,&]) = -ip 2 ([D 2 ,b]) , 

hence ip 2 ( [D 2 , b ]) = 0 . 

If, on the other hand, (i) is satisfied, that is [£> 2 ,^ 2 ] = 0, then: 

M[D 2 ,b]) = Tr H2 ([D 2 ,p 2 b\) = 0. 

(Since D 2 is unbounded, the cyclic property of the trace doesn’t hold. But one can 
prove the above equality by writing the trace in an eigenbasis of D 2 and p 2 .) ■ 


Proposition 3.10. Let p 1 and ip 2 be normal and satisfying (3.11) and (3.12). 
Then 

df, (<p, ip) < d D (<p, ip) < K df, (<p, ip) . 

If K = 1, then Pythagoras equality holds. 


PROOF. Let 7 := 7 i 0 1 and 

Pcp,tp{b) / yP<p,ii’{'yb') 

for all b € B(TL). For any y = yx, y*y = x*x and then ||y|| = ||x||. Hence 

II D II _ II )|| _ II D II _ /,' 

II-* ~ SU P II i/|| — II-* v’iV’II — , 

beB(n):b^o \n b II 

where V = 7 b. For all a € A: 

[D, P v ^(a)\ = [Di 0 1,id 0 4>l{a)\ + [71 0 D 2 , ip[ 0 id(a)] 

= (id 0 ip\){[D± 0 1, a]) + (71 0 l)(<pf 0 id)([1 0 D 2 , a]) 

= (id 0 %p\)([D\ 0 1, a]) + (<pf 0 id )([71 0 D 2 l a\) 

= P Vt ,/,([£>i 0 1, a]) + P ip ^([71 0 D 2 , a]) 

= P v ^{[D,a\) . 

In last-but-one equality we used (3.11) and (3.12). Thus || [D, P v ^(a)\ || < K\\[D, a]||. 

If we now repeat the proof of Prop. 3.7, we deduce that df,(p,tp) > K~ 1 dj 2 (p,ip), 

hence the thesis. ■ 
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While Prop. 3.7 can be used to prove Pythagoras equality in some examples 
(cf. §5.2), it is not clear if there are examples where K < y/2, and then Prop. 3.10 
allows to improve the bounds in Theorem 3.2. Note that a non-trivial C*-algebra 
projection has always norm K = l 5 . Unfortunately this is not true for idempotent 
endomorpliisms of a normed vector space. For example, if P € M n (C) is the 
matrix with all l’s in the first row and zero everywhere else, then P 2 = P, but 
PP* = diag(n, 0,..., 0) implies that the norm is yfn. 


4. Examples from classical and quantum transport 


In order to understand what the spectral distance looks like, it is useful to 
have in mind some examples. In this section, I collect some examples where the 
distance can be explicitly computed, which include: the canonical spectral triple of 
a finite metric space and of a Riemannian manifold, which are useful to illustrate 
the connection with transport theory, several natural spectral triples for the state 
space of a “qubit” (§4.5), and a digression on the Wasserstein distance between 
quantum states for the Berezin quantization of a homogeneous space. 


4.1. The two-point space. Let us start with the simplest example, A = C 2 . 
We identify A with the subalgebra of M 2 ( C) of diagonal matrices, acting on the 
Hilbert space P = C 2 via matrix multiplication. We get an even spectral triple 
(A,P, D,j) by taking D := ^F, where: 


1 

0 

, F := 

0 

1 

0 

-1 


1 

0 


(4.1) 


Here A > 0 is a fixed length parameter. We can identify tp € [—A, A] with the state: 


a = 


0 

0 a; 




1 + ip/X 

a -[■ 


1 - (fi/X 


(4.2) 


Clearly S(A) ~ [—A, A], and I will use the same symbol to denote a state and the 
corresponding point in [—A, A]. Pure states are given by ip = ±A. 

The proof that dn{p, VO = \<p — V’l is an exercise that I leave to the reader. 


4.2. The standard 2-simplex. After the two-point space, the simplest ex¬ 
ample is the space with three points at equal distance. Let A = C 3 (with com¬ 
ponentwise multiplication), represented on P = A ® A by diagonal multiplication, 
and D be given by D(a(Bb) = £>_(&)© D + (a) where D + is the permutation matrix 


D+ = 


0 0 1 
1 0 0 
0 1 0 


and D _ = (£>+)*. States are in bijection with points of the simplex (Fig. 3a), 


A 2 :={(/? G K 3 : ipi > 0 V i and ipi + ip 2 + <P 3 = 1} , 


5 P = P*P implies ||P|| 2 = ||P*P|| = ||P|j by the C*-identity, hence ||P|| is 0 or 1. 
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93 - 92 93 - 9l 




(a) The standard 2-simplex (b) The triangle T (red), Q( A 2 ) (black), 

and the hexagon E. 


Figure 3 


via the map 

= (¥>i, ¥> 2 , <^ 3 ) ^ ¥>* '■ ^*0) : = (<P, a) , V a G .4, 

where on the right we have the canonical inner product of C 3 . With a slight abuse 
of notation, from now on I will denote by ip (without *) both the vector and the 
corresponding state of A. The vertices ei = ( 1 , 0,0), e 2 = (0, 1 , 0) and e 3 = (0, 0, 1 ) 
are mapped to pure states, and the barycenter 

eo := |(ei + e2 + 63) 

of A 2 to the trace state. For all £ A 2 , the difference <p — ip belongs to the 
plane through the origin (parallel to A 2 and orthogonal to eo) given by: 

II := {u £ R 3 : i>i + v 2 + V 3 = 0} . 


Lemma 4.1. ||[D,a]|| = max,yj |at — aj\ for all a £ A sa . 


PROOF. [D,a\*[D,a] is the direct sum of the operators [D + , a]*[D + , a] and 
[D_,a\*[D_,a] = [D + , a][D + , a]*, which by the C*-identity have the same norm. 
So || [D, a] || = ||[L>+,a]||. From 


{D + ,a}*[D + ,a} 


we get the thesis. 


(ai - a 2 ) 2 0 0 

0 (a 2 - a 3 ) 2 0 

0 0 (01 - a 3 ) 2 


With this, one can prove that djj is the metric do(ei, ej) = 1 V * / j on pure 
states, and the Chebyshev metric on arbitrary states, as I show below. 

With the vector product we define a surjective linear map Q : R 3 -A II, 


Q(a) := a A 3e 0 = (a 2 - a 3 , a 3 — 01 , a 3 — a 2 ) , 
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satisfying the algebraic identity: 

(Q(v), Q{a)) = 3 (v, a) , V v € II, a € R 3 . (4.3) 

The translation a —> a — eo, orthogonal to A 2 , transforms A 2 into a new equilateral 
triangle T with barycenter at the origin O , and with vertices {qt = ei — eo} 3 =1 . 

For a £ A 2 , since Q(a) = Q(a — eo), we can imagine that Q is the composition 
of the translation A 2 —» T and the endomorphism Q |n of the plane II D T. Since 
v € II is orthogonal to eo, Q(v) = v A 3eo is a 90° clockwise rotation composed 
with a dilatation (uniform scaling) by a factor || 3eo ||2 = y/3. The situation is the 
one illustrated in Fig. 3b: the small triangle T is rotated and then scaled until it 
matches the big circumscribed triangle Q(A 2 ), with vertices {Q(e.j)} 3 = i and triple 
area. 6 

Let us come back to the spectral distance. The usefulness of the map Q is that 
||[D,a]|| = ||Q(a)|| 00 , Vae/ a , (4.4) 

cf. Lemma 4.1. 

Lemma 4.2. For all £ A 2 : 

dD(tp,ip) = ^ sup { (Q(ip - ip),w) : |H|oo < 1} • (4.5) 

PROOF. For any a £ „4 sa = R 3 , called v = ip — ip, then p(a) — ip (a) = (v, a) = 
| ( Q(v ), Q(a)). From this and (4.4), we get (4.5) by noticing that the map R 3 —> II, 
«4w = Q{a) is surjective. ■ 

The distance can now be explicitly computed. 

Proposition 4.3. For all <p,ip £ A 2 .- 

d D (p,ip) = \\<p-ip\\oo ■ 

Proof. The set 

E:={t»£ll: IMloo < 1} 

which appears in (4.5) is the hexagon (in the plane II) with vertices {g.j — qj}ijtj, 
cf. Figure 3b. We want to maximize the product 

(Q(p-ip),w) = \\Q{<p - VOII 2 IKH 2 = Vz\\p - iph\\w'\\ 2 , 
where w £ E and w' is the orthogonal projection in the direction of Q(p — ip). 
Clearly the maximum is reached when w is in the boundary of E, and more precisely 
when w is the nearest vertex, cf. Figure 4a. So, the maximum value of ||u/|| is 
V2cosa, and 

d D (p,ip) = \j ^ \\(p-ip\\ 2 cosa . 

6 It’s easy to check that T is inscribed in Q(A 2 ): for example, from the algebraic identity 
q -3 = |Q(e 1 ) + \Q(e 3 ) we see that is a convex combination of two vertices of Q(A 2 ), and then 
lie on the corresponding edge. 
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Figure 4 


The angle between Q(<p — ip) and the nearest vertex of the hexagon E is the same as 
the angle between <p— ip and the nearest vertex of the rotated hexagon ^Q(E). Since 
^Q(E) has vertices {±< 7 j}j = 1 , 2 , 3 ) then (minimal angle means maximum cosine): 


cos a = max 


*=b 2 > \\ip - V’lHMh 


Since ||*|| 2 = \f^ /3, we get 


d D (<p,ip) = max \(ip - ip,ei - e 0 }\ , 

1 = 1 , 2,3 

but (ip - ip, e 0 ) = 0 , so d D (p, ip) = max i= i i2 ,3 |‘Pi - ipi\ = \\ip - ip ||oo 


(4.6) 


From (4.6) we get a nice geometrical interpretation. The distance is obtained 
by projecting the vector <p — ip on the three medians of the simplex, and computing 
the max of the lengths of the three resulting segments. 

To get an idea of the behavior of do, we can compute its value when one state 
is a vertex, and the other is on the opposite edge of the simplex, say e.g. <p = e\ 
and ip = (1 — A)e 2 + Ae 3 for some A € [0,1]. Then doip, ip) = max{ 1, A, 1 — A} = 1 
is independent of A. 

If we draw a picture, the simplex with the spectral distance looks like a Reuleaux 
triangle, cf. Figure 4b. In the figure, ip' is the intersection of the line through tp and 
ip with the circle centered at p and passing through the opposite vertices; dn(p, ip) 
is the Euclidean distance between ip and ip'. 


4.3. Finite metric spaces. Given an arbitrary finite metric space (X,g), 
there is a canonical even spectral triple (A, 'H 1 D : 7 ) associated to it, with the prop¬ 
erty that the spectral distance between pure states coincides with the original metric 
g. This is constructed as follows. 

We take A = C N ~ C(X) and think of the pure states 

V a = (ai,..., a N ) G C N , i = 1,..., N, 


*£i(n) • , 


(4.7) 
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as the A r -points of X. For i,j = 1,..., N, let Hij = C 2 and Tr tl be the representation 

a,- 0 


i(a) = 


0 a. 


A faithful (unital *-)representation of A is given on T-L = ® ^ • Hij by 7 r = ^ij ■ 

I will identify A with tt(A) and omit the symbol n. Finally, we define D = ©^. Dij 
and 7 = ©^ ■ 7 ij, where D^ and 7 ij are the operators on Hif 


1 

' 0 

1 


1 

0 " 

9ij 

1 

0 

i lij ~ 

0 

-1 


and I use the shorthand notation < 7 ^ := g(xi, Xj) (note that gij 7 0 V i ^ j). 

It is not difficult to prove that the spectral distance between pure states is the 
metric we started from. 

Proposition 4.4. do{x k , xi) = gu for all k,l = 1,..., N. 

PROOF. Since [D,a] = [- 0 ^, 7177 ( 0 )], clearly 


II [D, a] || = max || [D ^, 717 ,- (a)] || = max 

*7 h * 7=7 


9ij 


(4.8) 


and then 


d D (x k ,Xi) = sup {a k - ai : ||[H,a]|| < 1} < g k i . 

oSR” 


The inequality is saturated by the element a = ( 51 ;, g 2 i, ■ ■ ■ , gm) € R w , which 
satisfies 


ai — dj 


9ij 


< |ai| + \dj\ = g u + gji < 1 
9ij 9ij 


due to the triangle inequality — so || [D, a] || < 1 - and 

a k — ai = g k i — gu = g k i . 


This is basically the construction of [16, §9] (see also [32, §2.2]). This spectral 
triple is the discrete analogue of the Hodge spectral triple discussed in §5.1. 

Note that the construction, as well as Prop. 4.4, remains valid if g is an extended 
metric (with the convention that in the definition of D , D^ = 0 if g-i-j = 00 ), but 
doesn’t extend to semi-metrics (we need g-ij 7 0 V i 7 j in order to define D). Since 
do is an extended metric, it is not surprising that g must be an extended metric in 
order for the proof of Prop. 4.4 to work. 

It is easy to give a transport theory interpretation to do{g>, VO i n the finite 
case. A general state has the form ip = YhiViXii where l is a probability 

distributions on X. 

As usual, suppose <p describes the distribution of some material in X 1 which 
we want to move to resemble another distribution (/>. A transport plan will be then 
described by a matrix P = (p^) S M N (M.) with non-negative entries: Pij > 0. The 
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quantity pij tells us the fraction of material we move from Xi to Xj (with pa the 
fraction which remains in the site i). which means 

T, Pik = 1 \/i = l,...,N. (4.9) 

z ' k 

A matrix P satisfying the above conditions is called stochastic. 

After the movement, the total material we want at the site Xj is ipj. Since 
at the beginning the amount was ipj, the difference between what we move in and 
what we move out from Xj must be tpj — <pj. The transport plan must then satisfy 
the condition 

(in)-(out ) = y k ‘PkP k j-'y k P j Pj k =tp j -p j V j = (4.10) 

which due to (4.9) is equivalent to: 

y, k VkPkj =ipj V j = 1,..., N. (4.11) 

If Cij is the unit cost to move from aq to Xj, the minimum cost for a transport 
ip —> ip will be: 

ip) := inf V' (4.12) 

where the inf is over all transport plans P from ip to ip. 

The set of transport plans p ^ ip is a subset of [0,1]^" C R^" defined by 
the system of linear equations (4.9) and (4.11). It is the intersection of an affine 
subspace of R^” with a hypercube: so it’s closed and bounded, hence compact 
(by Heine-Borel theorem). Since P K > JA . PiCijPij is a linear (hence continuous) 
function on a compact space, by Weierstrass theorem the infimum in (4.12) is 
actually a minimum (see e.g. Appendix E of [15]), thus justifying the terminology. 

Next proposition is the discrete (finite) version of Kantorovich duality, which 
can be easily derived from Exercise 1.7 of [35]. Next lemma can be found in the 
appendix of [13], and we repeat the proof here for the reader’s ease. 

Lemma 4.5. For all p,ip: 

K(ip,ip) = sup { tp(a) + ip(b) : a,i + bj < Cij V i, j} . (4.13) 

a,be R N 

PROOF. Since every finite-dimensional real inner product space is isometric to 
R" for some n, we can restate the duality relation in [35, Ex. 1.7] as follows. Let 
V, W be two finite-dimensional real inner product spaces. Then for any v £ V, 
w € W and / : V —> W we have 

sup { {v, u) v : w- f(u) h 0} = inf { (w,p) w : p h 0 and f T (p) = w} , (4.14) 

pGW 

where y 0 means that all components of the vector are non-negative, and the 
transpose f T of / is the map f T : W —> V defined in the usual way: 

(u, f T {h)) v = ( f{u ), h) w , \/u£V,h£W. 
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Now we apply this to V = R 2JV with canonical inner product, and W = Mjv(R) 
with weighted Hilbert-Schmidt inner product: 


Take 


(e, h) w = . . ip* eij h i:i , V e, ft € W. 


v = , w = (gij) , 


and / the function 


f(u)ij =a,i+bj , V« = 6jv) G V". 

The transpose is 

f (^)fc — Pk ^ ' ftfcj ? (k)k-\-N — ^ ' Piblik , 

for all k = 1,..., iV and ft € W. 

The condition f T {p) = u is equivalent to (4.11) plus the condition Y^kPik = 1 
for all i such that ^ / 0. On the other hand if </?j 0 = 0 for some *o, the row i 0 
of P does not contribute to (g,p) w , and we can always replace the row ?'o of P by 
an arbitrary probability distribution and find a new P satisfying (4.9) too. The 
right hand side of (4.14) is then equal to K(ip , ip ). Computing the left hand side of 
(4.14) one then gets (4.13). ■ 

Proposition 4.6. If the unit cost is Cij = g ^, then d , D {< p , ip ) = K ( ip , ip ) V < p , ip . 

PROOF. One can follow the proof of Theorem 1.14 in [35] (there is basically 
no simplification in the finite case), and prove that K (< p , ip ) is the supremum of 
< p ( a ) — ip(a) over a £ A satisfying [a* — Oj| < gij, which by (4.8) is equivalent to 
II [D, a]|| < 1. ■ 


An analogue of the Wasserstein distance for quantum states is discussed in §7.7 
of [2]. The advantage of a distance which can be defined both as an inf and a sup, 
is that from the former definition one can get upper bounds, and from the latter 
one gets lower bounds. With some luck, these will coincide and allow to explicitly 
compute the distance. What is missing for the spectral distance is a formulation 
as an infimum. 

In [37], the authors define a “Monge distance” between quantum states as the 
Wasserstein distance between the corresponding Husimi distributions of quantum 
optics. They also rewrite such a distance as a sup, dual to a seminorm on an 
operator space in the spirit of Rieffel’s quantum metric spaces [25, 27] (cf. Prop. 4 
of [37]). It is not clear however if such a seminorm comes from a Dirac operator. 

Following the approach of [37], I will explain in §4.6 how to define a “Wasser¬ 
stein” distance between quantum states using the symbol map of Berezin quanti¬ 
zation. 
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4.4. Riemannian manifolds. Let M be a finite-dimensional oriented Rie- 
mannian spin manifold with no boundary. A canonical even spectral triple can be 
defined as follows 

A = Cq°(M), n = n'(M), D = d + d*, (4.15) 

with H the Hilbert space of square integrable differential forms and D the Hodge- 
Dirac operator (self-adjoint on a suitable domain). The grading yw := (—1 ) k u> on 
fc-forms is extended by linearity to H. I will refer to (4.15) as the Hodge spectral 
triple of M. This spectral triple is even, even if M is odd-dinrensional, and is unital 
iff M is compact. 

It is well-known that, on pure states/points of M, the spectral distance coin¬ 
cides with the geodesic distance (with the convention that the geodesic distance 
between points in different connected components, if any, is infinite). 

If M is complete, the spectral distance between two arbitrary states coincides 
with the Wasserstein distance of order 1 between the associated probability distri¬ 
butions (see e.g. [10]). To explain the difficulty in computing such a distance (and 
then, more generally, the spectral distance), it is worth mentioning that the only 
case where the problem is completely solved is the real line [24, §3.1]. 

4.5. A simple noncommutative example: the Bloch sphere. The Bloch 
sphere is a geometrical realization of the space of pure states of a two-level quantum 
mechanical system (or qubit), cf. §2.5.2 and §10.1.3 of [29]. Among the several 
metrics used in quantum information, it is worth mentioning the Bures metric, 
which for a qubit can be explicitly computed and is given by Eq. (9.48) of [2]. Here 
I will discuss some natural metrics coming from spectral triples. 

So, let A = M 2 (C) be our algebra of “observables”. We cannot use the ob¬ 
vious representation on C 2 , since there is not enough room for a Dirac operator 
giving a finite distance [16]. Take H = M 2 (C) with Hilbert-Schmidt inner product 
(a, 6 ) hs = \ Tr(a* 6). 7 The basis of Pauli matrices: 


0 

1 


1 

7 

0 

1_ 


1 

0 



, CT 2 = 


, 03 = 



1 

0 


i 0 


0 

-1 


is orthonormal for such a product. The algebra A is represented on H by left 
multiplication. 

Every state on A is normal, with density matrices in bijection with points of 
the closed unit ball B 3 = {x £ R 3 : ||a:|| 2 < 1} in R 3 . Indeed, any selfadjoint matrix 
with trace 1 has the form 

Px = |(1 + X)i=l x i a i) > ( 4 - 16 ) 

where x = (x\,X 2 ,x 3 ) € R 3 . Since its two eigenvalues are ± j|a:|| 2)5 it is clearly 
positive iff ||ar ||2 < 1. It is a projection, hence a pure state, if ||ar ||2 = 1- The map 

B 3 -> S(A) , x^ Tr (p x .) , 

7 Note the different normalization between here and in §4.6, and (,) Tr in (2.4). 
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is then the bijection we were looking for (in fact, a homeomorphism with respect 
to the weak* topology). We will identify <S(A) and B 3 . 

We can retrieve the Euclidean distance by considering the Dirac operator 

D(a) := a* , Va£«. 

We can think of (A,T~L,D) as a spectral triple over the real numbers. As a real 
Hilbert space, H has inner product (a, b) R = 3?(a, b) Hg , Pauli matrices are still 
orthonormal, the representation of A is still a ^-representation , and D is selfadjoint 
in the sense of M-linear operators: (a, D(b)) R = (D(a),b) R V a,b GH. 


II [D, a] || 2 = sup 


Proposition 4.7. For all x, y G B 3 , do{x, y) = ||ar — 3 /II 2 - 

PROOF. Let a = ool + with a 3 G K. and v = (vi,V 2 ,v 3 ) G K 3 and 

note that Tr{(/gj; — p y )a } = (x — y,v) is the canonical inner product on M 3 , and 
do(x,y) is its sup under the condition ||[D,a]|| < 1. Since D is an isometry, [D,a\ 
and D[D,a] have the same norm. But D[D,a]b = [a, b] and 

([a, 6], [a,b]) R 

M0 (^) &)r 

Writing a = aol + v i (J i as above and b = bgl + w i a i, b 3 € C, w G C 3 : 

urn m2 II^ Au; I|2 

P.A = S UP 1, | 2 , 11 in • 

MO N 2 + IMI2 

The maximum is reached when 60 = 0 and w is orthogonal to v, and we get 
|| [D, a] || = ||u|| 2 ■ Therefore 

do{x,y) = sup {(x-y,v) : ||u|| 2 < l} , 

»ei 3 

which is equal to ||— 2 /||2 by Cauchy-Schwarz inequality. ■ 


We can get the same metric from a complex-linear Dirac operator, if we use a 
degenerate representation of the algebra. Let A as above, with representation on 
H = C 2 ® C 2 given by w(a) := a ® 0 (so, acting by matrix multiplication on the 
first summand, and trivially on the second), and let D(v ®w)=w(&v be the flip. 


Proposition 4.8. For all x , y G B 3 , d ^( x , y) = ||x — 2 /II 2 - 

PROOF. Since D[D , 7r(a)] = a ® —a V a G A, ||[£>, a]|| is the matrix norm of a. 
Writing as before a = aol + X), Viai, with a 3 G K. and v = (v 3 ,V2, v 3 ) G R 3 , we find 
HI = |a-o| + Ijnll 2 ■ By Cauchy-Schwarz inequality, (x — y,v) < ||a: — 2/||2 1 |"^||2, and 
the inequality is saturated if a 3 = 0 and v = (x ~ y) /\\x — y\\2- ■ 

Of the two spectral triples above, one was over the field of real numbers, and one 
was not unital (7r(l) is not the identity on %). Another way to get the Euclidean dis¬ 
tance, from a proper unital spectral triple (over C), is via the the SU (2)-equivariant 
Dirac operator of the fuzzy sphere [9, Prop. 4.3]. 

Still another unital spectral triple on M 2 (C) is the one obtained as a truncation 
of Moyal spectral triple [4, §4.3], which I briefly describe in the following. 
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Let A = M 2 ( C) and FL = C 2 ® C 2 as before, but with representation 7?(a) := 
a ® a, and let D be given by D(v © w ) = D_(w) © D + (v), where 


D+ = 


0 1 
0 0 


and D _ = (D + )*. In this case the computation of dg is a bit more involved. 


Proposition 4.9 (Prop. 4.4 of [4]). For all x,y £ B 3 , with x ^ y, one has 
dg(x,y) = c(9)\\x — y\\ 2 , where 6 £ [0,7r] is the polar angle of x — y and 


c(0) 


sind if f<0<f, 

|2 cos ^l” 1 otherwise. 


PROOF. As in the proof of Lemma 4.1, ||[D,a]|| = ||[D + ,a]|| for all a £ A sa . 
For a = Oq! + JT v i a ii with oq £ i and v = (v\,v 2 , v 3 ) £ M 3 , 


||[D + ,a]|MMl2 + k|. 


Due to a rotational symmetry on the horizontal plane, we see that dj^(x,y) is the 
sup of 

(x — y,v) = rt{ cos 8 cos </> + sin 8 sin cf) 


where r = ||ar — y\\ 2 , t = ||f|| 2 , <t> is the polar angle of v and 8 the one of x — y. The 
norm constraint is <(1 + | cos</>|) < 1. The factor c{ 8 ) in the proposition is then 
given by 


c(8) 


I cos 81 cos 6 + sin 8 sin (j> 
SU P -Tb- T - 

O<0<tt/2 1 + COS 0 


In the range [0, |], the derivative vanishes only if tan f = |tan$|, i.e. i) 0 = 2 8 
if 8 £ [0, |], ii) 0 = 2(7 t — 8 ) if 8 € [^f ,7r], and iii) it is always non-negative if 
J < 8 < In case (i) and (ii), the stationary point is a maximum, and gives 
c( 8 ) = 12cos01~ 1 . In case (iii), the sup is for 0 = n/2, and we get c( 6 ) = sin0. ■ 


4.6. Wasserstein distance between quantum states. We can define a 
Wasserstein metric on quantum states by means of Berezin quantization. 

Let G be a compact group and U : G -* A := B(FL) a finite-dimensional 
unitary irreducible representation of G. The adjoint action a on A is defined by 
ci x (a) = U x aUf. I will denote by (a, b) HS = Tr(a*6) the Hilbert-Schmidt inner 

product on A, with N := dimc(?t), and by ||(i||hs = (a, a) H g the corresponding 
norm. Let P £ B(fH) be a density matrix, H C G the stabilizer of P: 

H = {x £ G : a x (P) = P} , 

and M = G/H the quotient space. I will denote by dy the G- invariant measure on 
M , normalized to 1, and by 

{f,g ) L ^ : = [ f{x)*g{x)dy x 

Jm 
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the inner product on L 2 (M , d/i), where functions on M are identified with right H- 
invariant functions on G. We can define two linear maps, a symbol map A —> C(M), 
a K> a a , and a quantization map C(M) -A A, f ha Q j, as follows: 

cr a (x) := Tr(a x (P)a) , Q f := N [ a x (P)f(x)d/i x 

J M 

for all a £ A and / € C(M). They are one the adjoint of the other, that is 

{<r a ,f) L 2 ={a,Qf) HS , V a € A, f € C(M), 

as one can easily check. Since a x (P) is a density matrix, the map er is unital, 
positive and norm non-increasing. With a little work one can prove that the same 
three properties hold for Q (see e.g. [26]). In particular, the operator 

N f a x (P)dy x (4.17) 

J M 

is a-invariant, i.e. commutes with U y for all y € G. Since U is irreducible, from 
Schur’s lemma (4.17) is proportional to 1a- The proportionality constant can be 
computed by taking the trace of (4.17), thus proving that Q maps 1 to 1a- 

Due to the above properties, l- Q sends probability measures into density 
matrices, and Na sends density matrices into probability distributions 8 . The 
latter map is injective under the following assumptions: suppose G is a con¬ 
nected compact semisimple Lie group and P a rank-one projection (a pure state) 
which has the highest weight vector of the representation in its range. Then 
Span{a a; (P) : x € G} = B{T~L) [26, Thm. 3.1], the symbol map is injective 9 and the 
quantization map - being its adjoint - is surjective. 

Now, with a surjective symbol map we can give the following definition. Note 
that on M there is a unique G-invariant Riemannian metric, with normalization 
fixed by the condition that the associated volume form is dy. 

Definition 4.10. We call cost-distance W(p,r) between two density matrices 
the Wasserstein distance of order 1 between the two probability distributions Na p 
and Na T , with cost given by the geodesic distance on M. 

Note that the map p i-a Na p coincides with the map 5(A) —> S(C(M)) obtained 
by pulling back states with the quantization map that we considered in §6 of [7], 
This follows from the identity (Na p ,f) L2 = Tr (pQf). 

That W is finite follows from the next proposition. 

Proposition 4.11. W(p,r) < A7 3 / 2 G||p — t||hs f or a U density matrices p and 
t, where 

£ .— / dg eo (e, x'jdpx 
Jm 

is the mean value of the geodesic distance. 

®In particular, f M Na p {x)dp x = N [a p , 1)^2 = N (p, 1) HS = Tr(p) = 1 for any p. 

9 cr a (x ) = {Na x (P),a,) HS = 0Vi implies (6, a) Hg = 0V6 S BfH), and then a = 0. 
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PROOF. W(p,t) is the sup over all 1-Lipschitz functions / of: 

N {<?p{x) - a T (x)}f(x)&n x = N 2 l (a x (P), p — t) hs f(x)dfj, x 
JM JM 

= N 2 [ ( a x (P), p - r) HS {/(:r) - /(e)} dp x . 

J M 

Here e £ M is the class of the unit element of G, and we used the fact that 
N f M a x (P)dju x = 1, and (l,p — r) HS = N _1 (Tr p — Trr) = 0. Note also that 
11 ccr(-P) 11 hs = since a x (P) is arank 1 projection. Hence from Cauchy-Schwarz 

inequality and the 1-Lipschitz condition: 

W(p,t)<N 3 / 2 \\ P -t\\ hs [ d geo (e,x)dp x . u 

J M 

Let p x := a x (P) be the density matrix associated to a coherent state, depending 
on the class x £ M. The map x <—> p x is a homeomorphism from M to the set of 
coherent states with weak* topology. 

Proposition 4.12. If G is abelian, then W(p x ,p y ) < d geo (x,y) V x,y £ M. 

PROOF. W(p x ,p y ) is the sup over all 1-Lipschitz functions / of: 

W f (x, y) := N 2 [ ( a z (P), p x - p. y ) HS f(z)dp z 
J M 

= N 2 f ((a x -i z -a y -i z )P,P) f(z)dp z 

JM 

= N 2 f (a z {P),P) HS {f{xz) - f(yz)}dp z . 

JM 

Writing P = £ (£, . with £ G H a unit vector in the range of P, we deduce that 

(a z (P),P) m = \(U z tO\ 2 

is non-negative. Also 

N 2 f (a z {P),P ) HS dp z = N (1 A , P) Hg = Tr(P) = 1 . 

JM 

From the 1-Lipschitz condition we get: 

W f {x,y) <N 2 (a z (P), P) m d geo (xz,yz)dp z . 

JM 

Here is where we need the hypothesis that G is abelian: the geodesic distance is 
left G-invariant, so if G is abelian d geo {xz,yz) = d geo (x, y) and we get: 

W f (x,y) — dgeo (x,y) ■ N / (a z (P), P) H g d/i z — d geo (x, y) . 

Jm ■ 


Example 4.13. If G = SU( 2) and H = C 2 is the defining representation, one 
can explicitly compute the cost-distance W between arbitrary density matrices: with 
the identification S(M- 2 (C)) ~ P 3 , one finds thatW is proportional to the Euclidean 
distance on the unit ball. 



PYTHAGORAS THEOREM IN NONCOMMUTATIVE GEOMETRY 


27 


If G = SU(N ) and H = C N is the defining representation, the set of coherent 
states and the set of pure states coincide (both are isomorphic to M = CP"^ 1 ,). 

Since Q : C(M) —>• a! is surjective, we can define a quotient seminorm L on A 
as: 

L(a) := / e c(M) { |l/l,Lip : Q f =a )’ 

where ||/1|Lip = sup,,.^ |/(a;) — f(y)\/dgeo( x ,y) is the Lipschitz seminorm. Since 
(<t p , f) L2 = ( p , Q/) hs for any density matrix p, we get 

W(p,r)= sup { (p~ r,a) m : L{a) < 1} . 

aeA sa 

It would be nice to prove that the seminorm L comes from a spectral triple. A pos¬ 
sible route to spectral triples is by extending the quantization map from functions 
on M to spinors. The case G = SU{2) is discussed in [9] and in §6.4 of [7]. We can 
interpret the first inequality in Prop. 6.16 of [7] as follows: the spectral distance 
associated to the natural Dirac operator bounds the cost-distance from above. 

5. Pythagoras equality for commutative spectral triples 

In this section, I collect some (commutative) examples of products for which 
Pythagoras equality holds. Further examples, including a nonconnnutative one 
(Moyal plane), are briefly discussed in the next section (cf. Cor. 6.11). 

5.1. Pythagoras for a product of Riemannian manifolds. Given the 
Hodge spectral triples of two manifolds M\ and M 2 , we can define two spectral 
triples on M = M\ x M 2 . One as product of the spectral triples of the two factors, 
and one as the Hodge spectral triple associated to the product Riemannian metric 
on M cf. (1.1). These two give the same distance, cf. §3.2 of [11]. 

Verifying Pythagoras equality for the product of the Hodge spectral triples 
of two Riemannian manifold is then reduced to the problem of proving that the 
product metric on M in the sense of Riemannian manifold, i.e. (1-1), induces the 
product distance in the sense of metric spaces, cf. §2.1. This can be proved as 
follows: given two points x = (aq, x 2 ), y = ( 2 / 1 , 1 / 2 ) G Mi let 1 1 —> z(t) = ( Z\(t ), 22 (t)) 
be a geodesic between x and y, parametrized by its proper length (it is enough to 
give the proof when x,y are in the same connected component). One can prove 
that 1 1 ^ Zi(t ) is a geodesic in Mi between a:i and y\, and similarly t eq 22 (f) is a 
geodesic in M 2 between x 2 and y 2 , cf. §3.1 of [11], and that t is an affine parameter 
(not necessarily the proper length) for both curves. Integrating the line element 
one then proves that: 

do{x,y) 2 = do^Xi^i) 2 + d D2 {x 2l y 2 ) 2 , 

where dD { is the spectral/geodesic distance on M,, and do is the one of the Carte¬ 
sian product. So, the product of Hodge spectral triples is orthogonal in the sense 
of Def. 3.1. 
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In the proof, it is crucial the use of geodesics. In §5.3, I will show how to give a 
completely algebraic proof of Pythagoras equality for arbitrary finite metric spaces. 

Let me also stress that, already in the example of Riemannian manifolds, 
Pythagoras equality doesn’t hold for arbitrary states. In §3.3 of [11] we discuss 
a simple example where the ratio do / do 1 ^dD 2 assumes all possible values between 
1 and \/ 2 . 


5.2. Product of two-point spaces. The simplest possible example is the 
product of two copies of the two-point space spectral triple discussed in §4.1. Let 
then (Al,'Hi,.Di, 7 i) = (C 2 ,C 2 ,P, 7 ) and (A 2 , W 2 , D 2 ) = (C 2 ,C 2 ,F). To simplify 
the discussion, we fixed to 1/2 the parameter A in §4.1. We now prove that: 


Proposition 5 . 1 . For all product states ip,if and all a € > 4 sa , the condition 
(3.10) is satisfied. So: Pythagoras equality holds for arbitrary product states. 


PROOF. Product states = p>\ ® p 2 are in bijection with pairs (<pi,<p 2 ) G 
[—1,1] x [—1,1], cf. (4.2). Given two product states ip,ip , ker {P v ^) is spanned by 
the vector: 




<£>1 +1 0 

0 <£>i-l 


1/2 + 1 

0 


0 

1p2 - 1 


Gyp 1 


We can decompose any a G - 4 sa as 


, 24 12, £3 

a = x 0 l®l + — 7®l+yl®7+— v Vi ^ 


Xo,... ,x 3 G 


where the first three terms span _4i +A 2 : they are linearly independent and Ai +A 2 
has dimension dim .4 — dimker(P v ^) = 3. A simple computation gives 


- (7 ® l)[D,a] = F ® {(x 1 + £ 31 / 2)7 + £ 3 } + {(x 2 + x 3 ipi) + £ 37 } <g> F , (5.1) 


were in the computation we noticed that 


V v tf = (<£>1 1 + 7) ® {i>2 1 + 7) • 

With the isomorphism of unital ^-algebras M 2 ( C) ® M 2 (C) —> (the right 

factor inserted in 2 x 2 blocks) we transform (5.1) into the matrix: 


L ■.= 


0 (xi + x 3 <pi) + x 3 x 3 + (xi + £ 31 / 2 ) 0 

(x 2 + x 3 tpi) + x 3 0 0 x 3 — (xi + £ 31 / 2 ) 

x 3 + (xi + x 3 fi> 2 ) 0 0 (x 2 + x 3 ipi) - x 3 

0 x 3 - (xi + x 3 fi> 2 ) (x 2 + x 3 ipi) - x 3 0 


The eigenvalues of L 2 can be computed by first writing the characteristic polynomial 
and then solving a degree 2 equation (or call a = x 1 + £31/2, b = x 2 + x 3 p>\, c = x 3 
and compute the norm as a function of a, 6, c with Mathematica®). In this way we 
get ||L|| = || [D, a]||, that, as a function of £ 1 , 2 : 2 , £ 3 , is given by: 


f ( x ) ■■= V2 | £31 + sj (£1 + x 3 i/j 2 ) 2 + (£ 2 + X 3 lfii) 2 


(5.2) 
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The norm || [D, P ¥ , ) ^(a)]|| is obtained from (5.2) the substitution x 3 —> 0. From the 
triangle inequality |a + 6 |>|a| — | 6 |, 

f(x) > V2 |ar 3 | + \/{\xx\ - |a ; 3 |) 2 + (|a; 2 | - Ml) 2 . 

Called t := I 1 C 3 1, one can check that df jdt is non-negative, so / is an increasing 
function of |a? 3 1. Thus f(x) > f | X 3 =o, which is what we wanted to prove. ■ 

Every state of C 2 is normal. We can use this simple example to show that the 
upper bound in Prop. 3.10 is not optimal. Next proposition is independent of the 
choice of density matrices, i.e. on how we extend states of C 2 to M 2 (C). 

Proposition 5.2. For </j-i = ip 2 = 1, K = \\P V ^\\ = 3. 

PROOF. Due to Lemma 3.8, it is enough to prove that I\ > 3. For a = 7 (g> 7 : 

P<p,1>{a) = 1(8)1-78)1-18)7. 

Since ||a|| = 1, one has K > ||P ¥ , ) ^,(a)||. For v = (0,1)* 8 ) (0,1)*, P Vt y,{a)v = 3v. 
Hence \\P v ^{a)\\ >3. ■ 

Remark 5.3. Prop. 5.1 remains valid if we rescale the Dirac operators by 
arbitrary scale factors, say D\ = A F and D 2 = p.F, with X, p, > 0. In this case 
(5.2) is replaced by 

f(x) := y/X 2 + p 2 |x 3 | + \/X 2 (x 1 +x 3 ip 2 ) 2 F p 2 {x 2 + x 3 <pi) 2 

> ^TJP\x 3 \ + ^A 2 (|^|-|a)3|) 2 +M 2 (l^|-k3|) 2 , 

(by the triangle inequality) and with a derivation with respect to t = |ai 31 we prove 
that f is a non-decreasing function oft, so f{x) > /| 3 , 3=0 = || [D, P v ,i/>(a)]|| • 


5.3. Product of finite metric spaces. Here we consider two arbitrary finite 
metric spaces X\ and X 2 , with N\ resp. N 2 points, and the product of the corre¬ 
sponding canonical spectral triples introduced in §4.3. We adopt the notations of 
§4.3, and distinguish the two spectral triples by a sub/super-script 1, 2. 

Their product is a direct sum, over all i 7 ^ j and k l, of the spectral triples: 


(A 8) A 2 , njj 8) , 7 T ijk i := wjj 8) n 2 kl , D ijU := 8) 1 + 7 lj ® P>li 

which in turn is the product of the triples 

Mi, P\j = , n ij , D ij 1 llj ) . M 2 , Uh = C 2 , nh, Dh) , 

In this example, Pythagoras equality is satisfied by arbitrary pure states. 


(5.3) 

(5.4) 


Theorem 5.4. Given two arbitrary finite metric spaces, the product of their 
canonical spectral triples is orthogonal in the sense of Def. 3.1. 

PROOF. With the pure states (4.7) one can construct morphisms: 

x* 8) x) : C Ns -» C 2 , (i4 (oj, aj) , 
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where s = 1,2. Each triple (5.4) is the pullback of the canonical spectral triple on 
C 2 in §4.1 (possibly with different normalizations of the Dirac operator). Assume 
that p and ip are pure: 

p = xl (g) Xp , ip = x\ (g> x 2 q , 

with r,p,s,q fixed. Since (5.3) is a direct sum, 

II [A a] II = max \\[D ljkl ,n ijk i(a)}\\ > || [D rspq , 7r rsp9 (a)]|| 

' L r z Ji’ i r z ^ 

and 

d D (ip,ip) < d Drspq {p,ip) . (5.5) 

In this way, we reduce the problem to a product of two-point spaces. As shown in 
§5.2, for a product of two-point spaces 

dD rspq (<P, ip) = yj ( gl s ) 2 + ( 9 p q ) 2 = \/d Dl (<p 1 ,ip 1 ) 2 + d D 2 (p 2 ,ip 2 ) 2 ■ 

This proves dn{p,ip) < do, S3 dD 2 (<P, ip), the opposite inequality being always true, 
cf. (3.6). ■ 


6. Pythagoras from generalized geodesics 

In this section, I will discuss a class of states generalizing pure states of a 
complete Riemannian manifold and translated states on Moyal plane. For such 
states, I will then prove Pythagoras equality for the product of an arbitrary spectral 
triple with the two-point space, cf. Prop. 6.9, generalizing the results in [23, 22]. 
I will then show how to extend the result from the two-point space to an arbitrary 
finite metric space, cf. Prop. 6.10. 

6.1. Geodesic pairs. Let a : R -A Aut(A) be a strongly continuous one- 
parameter group of ^-automorphisms of a C*-algebra A. Let V{8) be the set of all 
a € A for which the norm limit: 

J(«) := lim 
w t^o t 

exists. Then V{8) is a dense *-subalgebra of A and S is a (unbounded) closed 
♦-derivation with domain T>(S) [30, §3]. Conversely, one can give sufficient condi¬ 
tions for 6 to generate an action of K on A [30, §3.4]. 

Let A C 'D(S) C A be a *-subalgebra (not necessarily dense). To any state r 0 
on A we can associate a family of states: 

T t {a) := T 0 (a t (a)) . (6.1) 

What allows to prove Pythagoras theorem in the examples in [23, 22] is that the 
curve t *-> Tt in state space is parametrized by the arc length (or has “unit speed”). 

Definition 6.1. Let (A,T~L,D, 7 ) be a spectral triple. A curve t, 1 —> ip t in state 
space is called a metric straight line ifdjj(ip t , <Ps) = — s| for all t,seR [ 12 , §6.1]. 

Let I C R be an interval containing 0. In the notations above, (<r| /, To) will be called 
a geodesic pair if du(T t ,T s ) = \t — s\ V t, s € /. 
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Example 6.2. Consider the Hodge-Dirac spectral triple of a complete oriented 
Riemannian manifold (M,g), or the natural spectral triple of a complete Riemann- 
ian spin manifold. In both cases, one easily checks that (see e.g. [ 10 ] for the latter 
example): 

[DJ]*[DJ}= g ^dJd v f, V/eC 0 °°(M,M), (6.2) 

where we use Einstein’s convention of summing over repeated indexes. 

By the Hopf-Rinow theorem, any two points are connected by a geodesic of 
minimal length, and every geodesic can be extended indefinitely (but it is only locally 
a metric straight line, hence the need of the interval I in the definition above). Let 

c : R -9 M 

be any geodesic and I 9 {0} a closed interval where c is of minimal length. If 
the curve is parametrized by the proper length, |c(£)| = 1 V t. Since c(I) C M 
is a properly embedded submanifold, the vector field c along c can be extended to 
a globally defined vector field V on M (cf. Prop. 5.5 and exercise 8-15 of [19],). 
Clearly | | = |c(£)| = 1 if x = c(t) £ c(I). We can choose V such that |14| < 1 for 

all x £ M (use a parallel frame to define such a V in a neighborhood of c(I), and 
a bump function to extend it globally as the zero vector field outside c(I)). 

Thinking ofV as a derivation on f £ Cg°(M), we define: 

■■= e tv f(x) . 

In the notations above, 6(f) = Vf and by Cauchy-Schwarz inequality: 

\6(f)(x)\ 2 < ||a M /(*)^|| 2 • \\Vxh < \\dj(xn = (g^djdvf)(x) , (6.3) 

for all f £ Cq°(M,R) and x £ M. We will need (6.2) and (6.3) later on. 

If tq is the pure state To(/) = /(c(0)), clearly Tt(f) = f(c(t)) V t £ I, and 

d D (T t ,T 8 ) = dgeo(c(t),c(s)) = \t - s\ , 

for all t,s £ I. Hence (<r|/, To) is a geodesic pair. 

Lemma 6.3. dofa, t s ) < |£ —s| V t, s r t (6(a)) < || [D, a]|| V £ and a £ „4 sa . 

Proof. dc(T t ,r s ) < |£ — s\ iff 

T t (a) - r s (a) < ||[£>,a]|| • |£ — s\ \/a£A sa . (6.4) 

By symmetry, we can assume that t > s. By linearity and continuity of r t : 

r t (a) - T s (a) = J ^T^(a)d( = t^(6(o)) d£ . 

If rc((5(a)) < || [D, a] || for all £ in the interior I of /, then 

T t( a ) ~ T s (a) < f || [D, a] ||d^ = || [D, a] || • |£ — s| 

J S 


(6.5) 
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for all t,s £ I, proving . If, on the other hand (6.4) holds for all t,s £ I and 
a € A sa , then by linearity and continuity of Tj: 

T t (S(a)) = ( lim T * +e(a) ~ Te(a) < ||[Aa]|| 

for all t £ I. By continuity we can replace I by I, proving “=>”. ■ 

Remark 6.4. Note that everything works even if A is not dense in A. But 
in this case the map 5(A) -A 5(A) is surjective but not bijective, and tq must be 
a state of A. In particular, given any unital spectral triple (A, TL,D) and taking 
A = 8(11), we can consider the action a t (a) = e ltD ae~ ltD ; the associated derivation 
is 6(a) = i[D,a] and automatically satisfies the condition of Lemma 6.3. 

Unfortunately, in order to prove Pythagoras equality the condition in Lennna 
6.3 is not enough. We need the following stronger assumption. 

Definition 6.5. A geodesic pair (ct|/,t 0 ) will be called strongly geodesic if 

T t ((Say(Sa) + b*b) < \\[D,a]*[D,a] + b*b\\ , V a,b € A sa ,t <E I. (6.6) 

Example 6 .6. The geodesic pair in Example 6.2 is strongly geodesic. Indeed, 
for all real functions a, b on M: 

T t ((5a)*(5a) + b*b) < Wg^d^ad^a + b 2 ^ = || [D, a\*[D, a\ + b*b\\ 

where in the first inequality I used (6.3), and in the last equality I used (6.2) and 
the observation that for functions the operator norm and the sup norm coincide. 


A noncommutative example is given by Moyal plane, which is recalled below. 

Lemma 6.7. Let (A, H®C 2 , D ) be an even spectral triple, with obvious grading, 
trivial action of A on the C 2 factor and 


D = 


0 £U 
D + 0 


(6.7) 


Let (cr|/,ro) be a geodesic pair. Assume 

5(a) = \ [uD + — uD _, o] , V a € A sa 
for some fixed u € U(l). Then (er|/, r 0 ) is strongly geodesic. 


PROOF. Since states have norm 1, it is enough to prove that 
|| (5a.)*(5a,) + b*b\\ < || [D, a]* [D, a] + b*b\\ 


for all a, b £ A sa . The left hand side of this inequality is the operator norm of 


5a 

0 

1 

up+,a\ 

0 

1 

+ TT 

— up-, a] 

0 

b 

0 

~ 2 

b 

0 

2 

b 

0 


From the triangle inequality: 


l.h.s. < 


1 


[D+,a\ 

b 


+ - 


P ,« 

b 


Jo [A,o]*[A,a]+6*6 
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On the other hand, from 


[D,a]*[D,a] +b*b = 


[D + ,a]*[D + ,a] + b*b 0 

0 [D^,a\*[D_,a\ + b*b 


we get 

jj [D,a]*[D,a\+b*b 
hence the thesis. 


max II [Di,a\*[Di,a] + b*b 
£{+,-} 


Example 6.8. Let ( A,TL,D ) be the even irreducible spectral triple of Moyal 
plane in [8, §3.3], given by A = <S(N 2 ) the algebra of rapid decay matrices on 
H = f“(N) ® C 2 with standard grading, and Dirac operator as in (6.7). Here D + 
is given on the canonical orthonormal basis of (, 2 (N) by: 

D+ | n) = y // f(n+ 1) |n + 1) , 

for all n > 0, Z)_ = ( D + )* and d > 0 is a deformation parameter. Every state To 
of A = /C(£ 2 (N)) is normal, To(a ) = {ifo\a\ipo) ■ For z £ C, we call 

T(z) := exp [\{zD + — zD_)} . 

The vectors \ip z ) := T(z) \ifo) give a family of normal states do, with z £ C. Fix 
u,v £ C with |u| = 1, and let 1 1 —>■ Zt = ut. + v be the corresponding line in C. Then 

t e^ u ^T{z t ) = e~ itx T(v) 

is a strongly continuous one-parameter group of unitaries generated by the (un¬ 
bounded, self adjoint on a suitable domain) operator 

X := 2 -(uD + - uD_) , (6.8) 

and (Jtfa) = T(zt)*aT(zt) is a strongly continuous one-parameter group of 
*-automorphisms generated by the derivation 6(a) = i[X,a]. In the notations above, 
ifr 0 := dtju, thenr t = 'F Zt for all t £ K; from [8, Prop. 4.3] we get du(T t ,T s ) = \t— s| 
for all (s £ R (the original proof is in [22]). So, (ct|j,to) is a geodesic pair; it 
follows from Lemma 6 .7 that it is also strongly geodesic. 

6.2. Products by C 2 . Let (A\,'H\,D\,')\) be the spectral triple of the two- 
point space in §4.1, (_4 2 , H 2 , D- 2 ) any unital spectral triples and (A, TL, D) the prod¬ 
uct spectral triple. Let (cr|/,r 0 ) be a strongly geodesic pair on the second spectral 
triple, and r 2 the corresponding state. Finally, let 

[—A, A] ■> <S(C 2 ) , t t) := FtA—t ^ _|_ FA—t ^ , 

be the map in (4.2). For x = (x±,X 2 ) € [—A, A] x / we define a product state 

t x ■= r Xl ® t 2 2 . 

Proposition 6.9. For all x,y, the states t x ,t v satisfy Pythagoras equality. 
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PROOF. Due to (3.6), it is enough to prove that 

d D (T x ,T y ) < d Dl (t^jT^) 2 + d D2 (t£ 2 ,Ty 2 ) 2 = \\x-y\\ 2 , (6.9) 

where the latter equality follows from the definition of geodesic pair. 

Let u G R 2 be the unit vector 

x-y 

U ~ \\x-vh 

and f a (t) := T ut+V (a ) (for all t s.t. U\t + yi G [—A, A]). For all a = (a-f, a±) G -4 sa : 

r\\ x ~yh 


T x{a) - T y (a) = f a (\\x - y\\ 2 ) - f a ( 0) = 


fa(t) d t < \\fa\\oo\\x - yh > ( 6 . 1 - 0 ) 


where /' is the total derivative of f a with respect to t. Using Cauchy-Schwarz 
inequality in R 2 we derive: 


( d , d ,\ 

/ 

d 

2 

d , . 

2 

(“•% r£( “ ) + “ 2 afe T « ,a) J 

- V 

£= ut+y V 

W, Te{> 

+ 

w/ f{a) 



(=ut+y 

An explicit computation gives 

^ r «( a ) = - H) > = T «(( id ® S ?)( a )) • 

Since r^(a) is linear in £ 1 , the supremum of |r|((id <g> <5 2 )(a)) | 2 over all £1 G [—A, A] 
is attained on one of the extremal points (is a degree 2 polynomial in £1 with 
non-negative leading coefficient). Hence: 

]T ? ((id® A 2 )(a))| 2 < max (|r| 2 (<5 2 a t )| 2 , |r| 2 (<5 2 a4.)| 2 } . 

For any state <p and any normal operator b, |<^(6)| 2 < <p(b*b) (Kadison-Schwarz 
inequality). Thus 

\.fai.t)\ 2 < max r| 2 ( Ci ) 

{T >4-/ 

where, for i G 

c-i := ( S 2 ai ) 2 + (2^u(a t - a;) 2 . 

From (6.6) we get 

\f a (t)\ 2 < .max ||[D 2 ,Oi]*[£> 2 ,Oi] + - aj.) 2 1| . 

\T 

On the other hand: 

In i*r n 1 = T [£ , 2 ,a t ]*[£> 2 ,a t ] + (2A)- 1 (a t -a 4 .) 2 

[ • [D 2l a l ]*[D 2 ,a l } + (2X)-\a t -a i )\ 

where the off-diagonal elements are omitted. By considering vectors in H = C 2 <g>'H 2 
with one component equal to zero, we get a lower bound on the norm: 

||[£>,a ]|| 2 > max ||[£> 2 ,a*]*[£> 2 ,a*] + 7 ^ 2(01 - a- 4 ,) 2 || > \f a (t)\ 2 . 


ieft.4-1 


Now (6.10) becomes r x (a) — r y (a) < ||[£>,a]|| • ||x — y\\ 2 , that is (6.9). 
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6.3. From two-sheeted to A'-shectod noncommutative spaces. Inspired 
by §5.3, we shall now see liow one can transfer results about Pythagoras from a 
product with C 2 to a product with an arbitrary finite metric space. 

Let {Ai, Hi, Di, 71 ) be the canonical spectral triple on a metric space with 
N points, introduced in §4.3, (A 2l TO, D 2 ) an arbitrary unital spectral triple, and 
(A, H, D) their product (3.1). Let (A', W , D') be the product of the spectral triple 
on a two-point space of §4.1 with the same spectral triple {A 2l R 2 , D- 2 ) above. Let 
(5|(a|, a.[.) = and <5j,(a-|-, 04 .) = a 4 . be the two pure states of C 2 . 


Proposition 6.10. Let <f 2 ,if 2 G S(A 2 ). If Pythagoras equality is satisfied by 
8 ) <p 2 and 8 if 2 in the product (A 1 , %’, D'), then it is satisfied by <p\ 8 ip 2 and 
ifi 8 if 2 in the product {A, H, D) for any two pure states <fi,ifi of Ai = C N . 


PROOF. We have a decomposition H = ©, #j C 2 8 P 2 , where on the summand 
( i,j ) the representation is given by 


for all a = (ai,...,ajv) € A = (^2)^ — C N 8 A 2 (each a,; here is a bounded 
operator on T-l 2 ). The Dirac operator is D = Dij, with 


1 

0 

1 

+ 

’ P>2 

0 

9ij 

1 

0 


0 

-d 2 


where each matrix entry is an operator on P 2 . Let = x r and ifi = x s be two 
pure states as in (4.7). If r = s, Pythagoras equality is trivial (it follows from 
Theorem 3.2(i) and Lemma 3.3, i.e. the fact that taking a product doesn’t change 
the horizontal resp. vertical distance). We can then assume that r ^ s. 

Since ||[D,a]|| = max,^ ||[Aj,7i'ij(o)]|| > ||[D rs , 7 r rs (a)]||, then 


d D (ip,ip) < d Drs (ip, if) = sup {gj 2 (a r ) - if 2 {a s ) : || [D rs , 7 r rs (a)] || < 1} . 


On the other hand, dD rs {(p,if) is the distance on a product of {A 2 ,Ti 2l D 2 ) with a 
two-point space (the two states x r and x s will give one and one <5j.), which by 
hypothesis is equal to 

\Jgfs + d D2 (<p 2 ,if 2 ) 2 = \Jd Dl ( x r , x a ) 2 + d D2 (<p 2 , if2) 2 ■ 


This proves dni^jif) < dn^ dD 2 ( l Pi if), the opposite inequality being always true, 
cf. (3.6). ■ 


Note that the states ip 2 ,if 2 in previous proposition are not necessarily pure. 

Corollary 6.11. Pythagoras equality is satisfied by pure states in the product 
of any Riemannian manifold with any finite metric space [23], and by translated 
states in the product of Moyal spectral triple with any finite metric space [22]. 
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7. A digression on purification 

Consider a product of unital spectral triples {A\, "Hi, D\, 71 ) and (A 2 , dii.Df). 
From (3.6): 

d Dl (y 1 ,V’l) < d D (ip,ip) 

for any states ip, ip with marginals <p\ = p\ and ip\ = ip\. When the second spectral 
triple is given by A 2 = 1-L 2 = C and D 2 = 0, for <p = p>\ (8> id and ip = ip 1 <8> id (the 
identity map is a state on C), the above inequality is clearly an equality. We can 
then say that: 

d Dl (<fii,ipi) = min d D (ip, ip) 

is a minimum over all the extensions p,ip of ipi,ip\ and over all unital spectral 
triples (A 2 , 7 ^ 2 , D 2 ). This may seem an overly convoluted way to look at the spec¬ 
tral distance, but establishes an analogy with the purified distance of quantum 
information, which is briefly discussed below. 

Given two states p>\ € <S(Ai) and tp 2 £ S(A 2 ), a purification is any pure state 
p £ <S(A! (8> A 2 ) with marginals p\ and p 2 . In Example 2.1, the Bell state is a 
purification of the tracial states Lp\ = p 2 = Trca(.). 

Let us focus on the example of matrix algebras, which is the one studied in 
quantum information. States are in bijection with density matrices, and in this 
section they will be identified. A state is pure if the density matrix p £ M/v(C) is 
a projection in the direction of a unit vector v of C N , i.e. p = vv*, where we think 
of v resp. v* as a column resp. row vector. 

In the matrix case, every state p has an almost canonical purification. Since p 
is positive, we can find an orthonormal eigenbasis {Viof C N and write 

\ - * 

P= 2 ^ 1 Pi v i v i , 

where is the probability distribution given by the eigenvalues of p. A 

purification is then the state p° £ M N (C) x M N ( C) given by 

p° = 2 _. \JV1V3 ViV* (8) ViV* , 

which is a projection in the direction of v° = Vlh Vi®Vi. It is almost canonical 

in the sense that it depends on the choice of eigenbasis. 

The purified distance d-p is usually defined in terms of Uhlmann’s fidelity, but 
the most interesting characterization is as the minimal trace distance between all 
possible purifications of the states [33, §3.4]. For p, r £ Mjy(C): 

dv{p, r) = inf \Jl - |(^,w )| 2 (7.1) 

where the inf is over all rank 1 projections p' = vv*,t' = ww* £ M N (C)<8>M N >(C), 
for all N' > 1, that have p and r as marginals: 


(id c « O Tr c „')(p') = P , (idc« ® Tr c „/)(r') = r . 
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Figure 5. Plot of (A.l). 


One can easily verify that the trace distance and the purified distance coincide on 
pure states. 

Example 7.1 (Bloch’s sphere). Consider the map B 3 —> S(M 2 ( C)), x K > p x , in 
%4-5. To compute the purified distance we use the formula dp(p, t) = ^/l — F(p, r) 
[33, Def. 3.8], together with the formula F(p,r) = Tr(pr) + 2-^/det p det r for the 
fidelity [37, Eq. (9.47)]. We get 

d v (p Vl p y ) = 2-^1- (x, y) - (1 - ||x|||) 2 (l - || 2/|||)2 . 

Ifx,y are unit vectors (pure states), with some algebraic manipulation one verifies 
that d-p is proportional to the Euclidean distance: dp(p y ,p y ) = ^||x — y\\ 2 . 


Appendix A. The surface A 1 x A 1 c A 3 


Since A 1 is a segment, from a metric point of view A 1 x A 1 with product 
metric is a square. We can plot its embedding in A 3 as follows. Vertices of A 3 are 
vectors {ei} 4 =1 of the canonical basis of R 4 ; let / : R 4 —> R 3 be the linear map 
sending the vertices of A 3 to alternating vertices of a cube, that is: /(ei) = (1,1,1), 
/(e 2 ) = (1, -1, -1), /(e 3 ) = (-1, 1 , -1), /(e 4 ) = (-1, - 1 , 1). 

With the obvious identification A 1 ~ [—1,1], the map from A 1 x A 1 to product 
states is 


[-1, l] 2 3 (f,») ~ P(M) := (<Wffi±£l, 


£ A 3 


Performing a parametric plot of 


(f,s) ^/op(f,s) = s ) 


(A.l) 


one gets the surface in Figure 5. One can imagine that the square is first bent along a 
diagonal, so that its two halves cover two faces of the tetrahedron 10 . Then it is bent 
by pushing its center, until it coincides with the barycenter of the tetrahedron 11 . 


10 Of course, one has to shrink the diagonal until its length matches the edges. 

14 The center (0,0) of the square is sent to (0, 0, 0) S IR 3 , which is the barycenter of /(A 3 ). 
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Note that this surface is just the intersection of a hyperbolic paraboloid (with 
equation 2 = xy) with the cube of vertices (±1, ±1, ±1), so in particular it is not an 
isometric deformation of the original square (by Gauss’s Theorema Egregium, since 
it is negatively curved). Thus, the geodesic distance induced by this embedding of 
A 1 x A 1 —> M 3 is not the product distance. On the other hand, the embeddings of 
A 1 x {s} and {£} x A 1 are isometric (these are the straight lines with constant s 
resp. t which are shown in the figure). The product metric is the pullback of the 
Euclidean distance on the base square in Figure 5 via the vertical projection. 

One can also check that the map ip —> p" from a state to the product of its 
marginals is just the vertical projection to the red-blue surface of the corresponding 
point in the tetrahedron in Figure 5. 
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